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NEW HUMAN HEPATOMA-DERIVED GROWTH FACTOR ENCODING SEQUENCE 
AND POLYPEPTIDE ENCODED BY SUCH DNA SEQUENCE 
AND PRODUCING METHOD THEREOF 

5 Field of invention 

This invention relates to the field of genetic engineering, and, in particular, relates to the nucleotide 
sequence of a novel human gene. More particularly, this invention relates to the cDNA sequence of a novel 
type II human Hepatoma-derived Growth Factor (HDGF2), which is a homologue of type I HDGF. The 
invention also relates to the polypeptides encoded by the nucleotide sequence, the uses of these 
10 polynucleotides and polypeptides, and the methods for producing them. 

S Prior art 

It is revealed that the regulation of cell growth is mediated by a series of cascade reactions triggered by 
^ the interaction between a variety of cytokines and their specific receptors on membrane surfaces. In tumor 

yj 15 cells, some steps of cascade reactions appear to be out of control, which results in continuous cellular 
proliferation. In hepatoma cells, several autocrine and paracrine cell factors were found (Proc. Natl. Acad. 
Sci. 83:2448-2452, 1986; Proc. Natl. Acad. Sci. 86:7432-7436, 1989; Cell 61: 1137-1146, 1990). 
Q Hepatoma-derived Growth Factor (HDGF) was a cytokine identified from human hepatoma-derived cell 

rr line HuH-7 cultured in serum-free medium. HDGF had the heparin-binding activity and stimulated the 

01 20 DNA synthesis in Swiss 3T3 cell (J. Biol. chem. 269 (40): 25143-25149, 1994). 

In 1989, HDGF was first partially purified from HuH-7 cells and characterized by Nakamura et.al. 
(Clin. Chim. Acta. 183:273-284, 1989). This research group cloned the full length HDGF cDNA sequence 
in 1994 (J. Biol. Chem. 269(40): 25143-25149,1994). In 1997, this group found the mouse homologue of 
human HDGF as well as other two members of the gene family, HRP-1 and HRP-2. They all had a highly 
25 conserved N-terminal of 98 amino acids. (Biochem. Biophys. Res. Commun.238: 26-32,1997). 

Prior to this invention, none has disclosed human HDGF2 of the present application concerns, which is 
another member of the human HDGF family. 

Summary of Invention 

30 One purpose of the invention is to provide a new polynucleotide which encodes a homologue of HDGF. 

In the invention, the gene of said homologue of HDGF is named HDGF2. 

Another purpose of the invention is to provide a novel protein, which is named HDGF2. 
Still another purpose of the invention is to provide a new method for preparing said new HDGF2 
protein by recombinant techniques. 
35 The invention also relates to the uses of said HDGF2 protein and its coding sequence. 

In one aspect, the invention provides an isolated DNA molecule, which comprises a nucleotide 
sequence encoding a polypeptide having human HDGF2 protein activity, wherein said nucleotide sequence 
shares at least 70% homology to the nucleotide sequence of nucleotides 121-732 in SEQ ED NO: 3, or said 
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nucleotide sequence can hybridize to the nucleotide sequence of nucleotides 121-732 in SEQ ID NO: 3 
under moderate stringency. Preferably, said nucleotide sequence encodes a polypeptide comprising the 
amino acid sequence of SEQ ID NO: 4. More preferably, the sequence comprises the nucleotide sequence 
of nucleotides 121-732 m SEQ ID NO: 3. 
5 Further, the invention provides an isolated HDGF2 polypeptide, which comprises a polypeptide having 

the amino acid sequence of SEQ ID NO: 4, its active fragments, and its active derivatives. Preferably, the 
polypeptide is a polypeptide having the amino acid sequence of SEQ ID NO: 4. 

The invention also provides a vector comprising said isolated DNA. 

The invention further provides a host cell transformed with said vector. 
10 In another aspect, the invention provides a method for producing a polypeptide with the activity of 

HDGF2 protein, which comprises: 

(a) forming a HDGF2 protein expression vector comprising the nucleotide sequence encoding the 
polypeptide having the activity of HDGF2 protein, wherein said nucleotide sequence is operably linked 
with an expression regulatory sequences, and said nucleotide sequence shares at least 70% homology to the 

1 5 nucleotide sequence of positions 121-732 in SEQ ID NO: 3; 

(b) introducing the vector of step (a) into a host cell, thereby forming a recombinant cell of HDGF2 
protein; 

(c) culturing the recombinant cell of step (b) under the conditions suitable for the expression of 
HDGF2 polypeptides; 

20 (d) isolating the polypeptides having the activity of HDGF2 protein. 

In one embodiment of the present invention, the isolated polynucleotide has a full length of 1024 
nucleotides, whose detailed sequence is shown in SEQ ID NO: 3. The open reading frame (ORF) is located 
at nucleotides 121-732. 

25 In the present invention, the term "isolated" or "purified" or "substantially pure" DNA refers to a DNA 

or fragment which has been isolated from the sequences which frank it in a naturally occurring state. The 
term also applies to DNA or DNA fragment which has been isolated from other components naturally 
accompanying the nucleic acid and from proteins naturally accompanying it in the cell. 

In the present invention, the term "HDGF2 protein encoding sequence" or " HDGF2 polypeptide 

30 encoding sequence" refers to a nucleotide sequence encoding a polypeptide having the activity of HDGF2 
protein, such as the nucleotide sequence of positions 121-732 in SEQ ID NO: 3 or its degenerate sequence. 
The degenerate sequences means the sequences formed by replacing one or more codons in the ORF of 
121-732 in SEQ ID NO: 3 with degenerate codes which encode the same amino acid. Because of the 
degeneracy of codon, the sequence having a homology as low as about 70% to the sequence of nucleotides 

35 121-732 in SEQ ID NO: 3 can also encode the sequence shown in SEQ ID NO: 4. The term also refers to 
the nucleotide sequences that hybridize to the nucleotide sequence of nucleotides 121-732 in SEQ ID NO: 
3 under moderate stringency or preferably under high stringency. In addition, the term also refers to the 
sequences having a homology of at least 70%, preferably 80%, more preferably 90%) to the nucleotide 
sequence of nucleotides 121-732 in SEQ ID NO: 3. 
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The term also refers to variants of the sequence in SEQ ID NO: 3, which are capable of encoding a 
protein having the same function as human HDGF2 protein. These variants includes, but are not limited to, 
deletions, insertions and/or substitutions of several nucleotides (typically 1-90, preferably 1-60, more 
preferably 1-20, and most preferably 1-10) and additions of several nucleotides (typically less than 60, 
5 preferably 30, more preferably 10, most preferably 5) at 5' end and/or 3' end. 

In the present invention, "substantially pure" proteins or polypeptides refers to those which occupy at 
least 20%, preferably at least 50%, more preferably at least 80%, most preferably at least 90% of the total 
sample material (by wet weight or dry weight). Purity can be measured by any appropriate method, e.g., in 
the case of polypeptides by column chromatography, PAGE or HPLC analysis. A substantially purified 

1 0 polypeptides is essentially free of naturally associated components. 

In the present invention, the term "HDGF2 polypeptide" or "HDGF2 protein" refers to a polypeptide 
having the activity of HDGF2 protein comprising the amino acid sequence of SEQ ID NO: 4. The term also 
comprises the variants of said amino acid sequence which have the same function of human HDGF2. These 
variants include, but are not limited to, deletions, insertions and/or substitutions of several amino acids 

15 (typically 1-50, preferably 1-30, more preferably 1-20, most preferably 1-10), and addition of one or more 
amino acids (typically less than 20, preferably less than 10, more preferably less than 5) at C-terminal 
and/or N-terminal. For example, the protein functions are usually unchanged when an amino residue is 
substituted by a similar or analogous one. Further, the addition of one or several amino acids at C-terminal 
and/or N-terminal will not change the function of protein. The term also includes the active fragments and 

20 derivatives of HDGF2 protein. 

The variants of polypeptide include homologous sequences, allelic variants, natural mutants, induced 
mutants, proteins encoded by DNA which hybridizes to HDGF2 DNA under high or low stringency 
conditions as well as the polypeptides or proteins retrieved by antisera raised against HDGF2 polypeptide. 
The present mvention also provides other polypeptides, e.g., fusion proteins, which include the HDGF2 

25 polypeptide or fragments thereof. In addition to substantially full-length polypeptide, the soluble fragments 
of HDGF2 polypeptide are also included. Generally, these fragments comprise at least 10, typically at least 
30, preferably at least 50, more preferably at least 80, most preferably at least 100 consecutive amino acids 
of HDGF2 polypeptide. 

The present invention also provides the analogues of HDGF2 protein or polypeptide. Analogues can 
30 differ from naturally occurring HDGF2 polypeptide by amino acid sequence differences or by 
modifications which do not affect the sequence, or by both. These polypeptides include genetic variants, 
both natural and induced. Induced variants can be made by various techniques, e.g., by random mutagenesis 
using irradiation or exposure to mutagens, or by site-directed mutagenesis or other known molecular 
biologic techniques. Also included are analogues which include residues other than those naturally 
35 occurring L-amino acids ( e.g., D-amino acids) or non-naturally occurring or synthetic amino acids (e.g., 
beta- or gamma-amino acids). It is understood that the polypeptides of the invention are not limited to the 
representative polypeptides listed hereinabove. 

Modifications ( which do not normally alter primary sequence) include in vivo, or in vitro chemical 
derivation of polypeptides, e.g., acelylation, or carboxylation. Also included are modifications of 
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glycosylation, e.g., those made by modifying the glycosylation patterns of a polypeptide during its 
synthesis and processing or in the further processing steps, e.g., by exposing the polypeptide to enzymes 
which affect glycosylation (e.g., mammalian glycosylating or deglycosylating enzymes). Also included are 
sequences which have phosphorylated amino acid residues, e.g., phosphotyrosine, phosphoserine, 
5 phosphothronine, as well as sequences which have been modified to improve their resistance to proteolytic 
degradation or to optimize solubility properties. 

The invention also includes antisense sequence of the sequence encoding HDGF2 polypeptide. Said 
antisense sequence can be used to inhibit expression of HDGF2 in cells. 

The invention also includes probes, typically having 8-100, preferably 15-50 consecutive nucleotides. 
10 These probes can be used to detect the presence of nucleic acid molecules coding for HDGF2 in samples. 

The present invention also includes methods for detecting HDGF2 nucleotide sequences, which 
comprises hybridizing said probes to samples, and detecting the binding of the probes. Preferably, the 
samples are products of PGR amplification. The primers in PGR amplification correspond to coding 
sequence of HDGF2 polypeptide and are located at both ends or in the middle of the coding sequence. In 
1 5 general, the length of the primers is 20 to 50 nucleotides. 

A variety of vectors known in the art, such as those commercially available, are useful in the invention. 

In the invention, the term "host cells" includes prokaryotic and eukaryotic cells. The common 
prokaryotic host cells include Escherichi coli, Bacillus subtilis, and so on. The common eukaryotic host 
cells include yeast cells, insect cells, and mammalian cells. Preferably, the host cells are eukaryotic cells, 
20 e.g., CHO cells, COS cells, and the like. 

In another aspect, the invention also includes antibodies, preferably monoclonal antibodies, which are 
specific for polypeptides encoded by HDGF2 DNA or fragments thereof. By "specificity", it is meant an 
antibody which binds to the HDGF2 gene products or a fragments thereof. Preferably, the antibody binds to 
the HDGF2 gene products or a fragments thereof and does not substantially recognize nor bind to other 
25 antigenically unrelated molecules. Antibodies which bind to HDGF2 and block HDGF2 protein and those 
which do not affect the HDGF2 function are included in the invention. The invention also includes 
antibodies which bind to the HDGF2 gene product in its unmodified as well as modified form. 

The present invention includes not only intact monoclonal or polyclonal antibodies, but also 
immunologically-active antibody fragments, e.g., a Fab' or (Fab)2 fragment, an antibody light chain, an 
30 antibody heavy chain, a genetically engineered single chain Fv molecule (Lander, et al.,US Pat No. 
4,946,778), or a chimeric antibody, e.g., an antibody which contains the binding specificity of a murine 
antibody, but the remaining portion of which is of human origin. 

The antibodies in the present invention can be prepared by various techniques known to those skilled 
in the art. For example, purified HDGF2 gene products, or its antigenic fragments can be administrated to 
35 animals to induce the production of polyclonal antibodies. Similarly, cells expressing HDGF2 or its 
antigenic fragments can be used to immunize animals to produce antibodies. Antibodies of the invention 
can be monoclonal antibodies which can be prepared by using hybridoma technique (See Kohler, et al.. 
Nature, 256; 495,1975; Kohler, et al., Eur. J. Immunol. 6: 51 1,1976; Kohler, et al., Eur, J. Immunol. 6: 292, 
1976; Hammerling, et al., In Monoclonal Antibodies and T Cell Hybridomas, Elsevier, N.Y., 1981). 
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Antibodies of the invention comprise those which block HDGF2 function and those which do not affect 
HDGF2 function. Antibodies in the invention can be produced by routine immunology techniques and 
using fragments or functional regions of HDGF2 gene product. These fragments and functional regions can 
be prepared by recombinant methods or synthesized by a polypeptide synthesizer. Antibodies binding to 
5 unmodified HDGF2 gene product can be produced by immunizing animals with gene products produced by 
prokaryotic cells (e.g., E. coli); antibodies binding to post-translationally modified forms thereof can be 
acquired by immunizing animals with gene products produced by eukaryotic cells (e.g., yeast or insect 
cells). 

The full length human HDGF2 nucleotide sequence or its fragment of the invention can be prepared by 
10 PGR amplification, recombinant method and synthetic method. For PGR amplification, one can obtain said 
sequences by designing primers based on the nucleotide sequence disclosed in the invention, especially the 
y sequence of ORF, and using cDNA library commercially available or prepared by routine techniques 

z^i known in the art as a template. When the sequence is long, it is usually necessary to perform two or more 

03 PGR amplifications and link the amplified fragments together in the correct order. 

15 Once the sequence is obtained, a great amount of the sequences can be produced by recombinant 

flj methods. Usually, said sequence is cloned in a vector which is then transformed into a host cell. Then the 

'''^ sequence is isolated from the amplified host cells using conventional techniques. 

Q Further, the sequence can be produced by synthesis. Typically, several small fragments are synthesized 

and linked together to obtain a long sequence. At present, it is completely feasible to chemically synthesize 
ffi 20 the DNA sequence encoding the protein of the invention, or the fragments or derivatives thereof In 
tj addition, the mutation can be introduced into the sequence of the protein by chemical synthesis. 

In addition to recombinant techniques, the protein fragments of the invention may also be prepared by 
direct chemical synthesis using solid phase synthesis techniques (Stewart et al., (1969) Solid-Phase Peptide 
Synthesis, WH Freeman Go., San Francisco; Merrifield J. (1963), J. Am. Ghem. Assoc. 85: 2149-2154). Li 
25 vitro protein synthesis can be performed manually or automatically, e.g., using a Model 431 Peptide 
Synthesizer (Applied Biosystems, Foster Gity, GA). The fragments of protein of the invention can be 
synthesized separately and linked together using chemical methods so as to produce full-length molecule. 

The sequences encoding the protein of the present invention are also valuable for gene mapping. For 
example, the accurate chromosome mapping can be performed by hybridizing cDNA clones to a 
30 chromosome in metaphase. This technique can use cDNA as short as about 500bp, or as long as about 
2000bp, or more. For details, see Verma et al.. Human Ghromosomes: A Manual of Basic Techniques, 
Pergamon Press, New York (1988). 

Once a sequence has been mapped to a precise chromosomal location, the physical position of the 
sequence on the chromosome can be correlated with genetic map data. Such data are found in, e.g., 
35 Mendelian Inheritance in Man (available on-line through Johns Hopkins University Welch Medical 
Library). The relationships between genes and diseases that have been mapped to the same chromosomal 
region are then identified through linkage analysis. 

Then, the differences in the cDNA or genomic sequence between affected and unaffected individuals 
can also be determined. If a mutation is observed in some or all of the affected individuals but not in any 
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normal individual, then the mutation is likely to be the causative agent of the disease. 

The substances which act with the HDGF2, e.g., receptors, inhibitors and antagonists, can be screened 
out by various conventional techniques, using the protein of the invention. 

The protein, antibody, inhibitor, antagonist or receptor of the invention provide different effects when 
5 administrated in therapy. Usually, these substances are formulated with a non-toxic, inert and 
pharmaceutically acceptable aqueous carrier. The pH typically ranges from 5 to 8, preferably from about 6 
to 8, although pH may alter according to the property of the formulated substances and the diseases to be 
treated. The formulated pharmaceutical composition is administrated in conventional routine including, but 
not limited to, intramuscular, intraperitoneal, subcutaneous, intracutaneous, or topical administration. 
10 As an example, the human HDGF2 protein of the invention may be administrated together with the 

suitable and pharmaceutically acceptable carrier. The examples of carriers include, but are not limited to, 
saline, buffer solution, glucose, water, glycerin, ethanol, or the combination thereof. The pharmaceutical 
C; formulation should be suitable for the delivery method. The human HDGF2 protein of the invention may be 

U2 in the form of injections which are made by conventional methods, using physiological saline or other 

yl 1 5 aqueous solution containing glucose or auxiliary substances. The pharmaceutical compositions in the form 
flj of tablet or capsule may be prepared by routine methods. The pharmaceutical compositions, e.g., injections, 

solutions, tablets, and capsules, should be manufactured under sterile conditions. The active ingredient is 
p administrated in therapeutically effective amount, e.g., from about lug to 5mg per kg body weight per day. 

W Moreover, the polypeptide of the invention can be administrated together with other therapeutic agent. 

ffi 20 When the human HDGF2 polypeptides of the invention are used as a pharmaceutical, the 

therapeutically effective amount of the polypeptides are administrated to mammals. Typically, the 
therapeutically effective amount is at least about 10 ug/kg body weight and less than about 8 mg/kg body 
weight in most cases, and preferably about lOug-1 mg/kg body weight. Of course, the precise amount will 
depend upon various factors, such as delivery methods, the subject health, and the like, and is within the 
25 judgment of the skilled clinician. 

Description of Drawings 

Svib ^ shows an alignment comparison of nucleotide sequences of HDGF2 of the invention and mouse 

HDGF2. The identical nucled^tides are indicated by "|". 
30 Fig. 2 shows an alignmenr\comparison of amino acid sequences of HDGF2 of the invention and mouse 

HDGF2. The identical and similar amino acids are indicated by "|'' and respectively. 

In one embodiment, the cDN/\sequence of HDGF2 was obtained as follows: human testis ^ gt 1 1 
cDNA library (Clontech) was used ^s a template and PGR was carried out with the synthetic forward 
35 primer Al :5'-ACCGCTCGTC CGCCCGGCTT GAG-3' and reverse primer B :5'-GATCCTAGAC 
ATGTATAAGT CTGCGC-3'. Target fragments of 1024bp were obtained. The sequencing of the PGR 
product gave the full length cDNA sequenA^ shown in SEQ ID NO: 3. 

Hepatoma-derived Growth Factor (HDGF) is a hepatin-binding protein isolated from human hepatoma- 
derived cell line HuH-7. HDGF has the activity of stimulating cell growth and promoting the growth of 
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fibroblast and some heptoma cells (J.Biol.Chem. 269(40): 25143-25149, 1994). It is expressed in human 
heart, brain, lung, liver, etc., and several tumor-derived cell lines (J.Biol.Chem. 269(40): 25143-25149, 
1994). The expression patterns of the HDGF gene family members are different. However, they are all 
enriched in testis and the 5'-untranslated region contains GC-rich nucleotide sequences (GC content >70%) 
5 (Biochem. Biophys. Res. Comun. 238: 26-32, 1997), suggesting their potential important roles in male 
germ-cell development. They may also relate to DNA methylation, chromatin conformation, and 
translational regulation (J. Cell. Biol. 115: 887-903, 1990; Cell 62: 503-514, 1990). Although HDGF 
protein is located mainly in cytoplasm (J. Biol. Chem. 269(40): 25143-25149, 1994), the amino acid 
sequences of family members all contain a putative Nuclear Localization Signal (NLS), and none have any 

10 signal peptide sequence, which suggests they may play a role in nucleus. Furthermore, the acidic amino 
acid sequence in the C-terminus of HDGF shares a high homology to that of HMG-1/-2 of HMG family 
(This sequence is known to be a histone-binding region in HMG-1/-2) (Biochemistry 29: 4419-4423, 1990). 
It is likely that HDGF functions as a transcriptional factor to stimulate cell growth after internalization 
(Biochem. Biophys. Res. Comun. 238: 26-32, 1997). The mitogenic activity of HDGF implies the great 

15 application value of HDGF in treating pernicious oxyhepatitis and liver injury (Clin. Chim.Acta. 183L 273- 
284, 1989), Researches indicate that many fibroblast growth factors can be widely applied to the 
vascularization defects, i.e., ischemia and atherosclerosis, and to neuron development (Blood 91(10): 3527- 
3561, 1998; Ann. N. Y. Acad. Sci. 545: 240-252, 1998). 

20 The invention is further illustrated by the following examples. It is appreciated that these examples are 

only intended to illustrate the invention, but not to limit the scope of the invention. For the experimental 
methods in the following examples, they are performed under routine conditions, e.g., those described by 
Sambrook. et al., in Molecule Clone: A Laboratory Manual, New York: Cold Spring Harbor Laboratory 
Press, 1989, or as instructed by the manufacturers, unless otherwise specified. 

25 

Example 1 

The cloning and sequencing of HDGF2 cDNA sequence 

1. Amplification with primers 

The template was human testis ^ gt 1 1 cDNA library (commercially available from Clontech). PCR 
30 was carried out with the forward primer Al: 5'-ACCGCTCGTC CGCCCGGCTT GAG-3' (SEQ ID NO: 1) 
and reverse primer A2: 5'-GATCCTAGAC ATGTATAAGT CTGCGC-3' (SEQ ID NO: 2). The PCR 
condition was 4 mins at 93°C; followed by 35 cycles with 1 min at 93*'C, 1 min at 68.5*'C, and 1 min at 
72*^0; and, finally 5 mins at 72°C. The PCR fragments were detected by electrophoresis. The target 
fragment was 1024bp. 

35 

2. Sequencing PCR products 

The above obtained PCR products were linked with pGEM-T ™ vector (Promega) and transformed 
into E. coli JM103. The plasmids were extracted using QIAprep Plasmid Kit (QIAGEN). The oriented 
serial deletion of the inserted fragments was carried out with Double-Stranded Nested Deletion Kit 
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(Pharmacia), and the deletants were quickly identified by PCR and arranged in order. The deletants 
successively cut-off were sequenced with SequiTherm EXCEL™ DNA Sequencing Kit (Epicentre 
Technologies). A full length cDNA sequence of 1024bp was obtained by overlapping the sequences with 
computer software. The detailed sequence is shown in SEQ ID NO: 3 with an open reading frame (ORE) 
5 located at nucleotides 121-732. 

According to the resultant full-length cDNA sequence, the amino acid sequence of HDGF2 was 
deduced, having 203 amino acid residues totally. See SEQ ID NO: 4 for its amino acid sequence in details. 



The full length cDNA sequence of human HDGF2 and the encoded protein were used for homologous 
searching Non-redundant GenBank + EMBL + DDBJ + PDB and on-redundant GenBank CDS translations 
+ PDB + SwissProt + Spupdate + PIR databases by BLAST algorithm. The result showed that they shared 
high homology to mouse HDGF2 (dbj|D63707|MUSHDGF) gene and its encoded protein. Using 

15 PCGENE software, it was found that they shared 68.7% identity at the nucleic acid level and 53.7% 
identity and 9.4% similarity at the protein level (FIG.l and FIG. 2). In particular, the conserved 98-amino 
acid N-terminal showed 90% homology to mouse HDGF. In addition, human HDGF2 was homologous to 
another mouse HDGF gene (dbj|D63850|MUSHDGF) and another human HDGF gene 
(dbj|D16431|HUMHDGF). These abovementioned genes are regarded as a family. So the functions of the 

20 HDGF2 can be deduced from the known functions of these genes and proteins. 

Hepatoma-derived Growth Factor (HDGF) is a hepatin-binding protein isolated from human hepatoma- 
derived cell line HuH-7. HDGF has the activity of stimulating cell growth and promoting the growth of 
fibroblast and some heptoma cells (J.Biol.Chem. 269(40): 25143-25149, 1994). Though HDGF was firstly 
identified in hepatoma cells, Northern blotting analysis showed that it was expressed in human heart, brain, 

25 lung, liver, etc., and several tumor-derived cell lines (J.Biol.Chem. 269(40): 25143-25149, 1994). It needed 
further investigation to determine whether HDGF was expressed differently in normal cells and tumor cells 
(J.Biol.Chem. 269(40): 25143-25149, 1994). The functions of HDGF in hepatoma cells and its influence on 
hepatoma treatment would be revealed constantly as the researches were carried out. The expression 
patterns of the HDGF gene family members were different. However, they were all enriched in testis and 

30 the 5 '-untranslated region contained GC-rich nucleotide sequences (GC content >70%) (Biochem. Biophys. 
Res. Comun. 238: 26-32, 1997). This property was similar to genes specifically expressed in testis or 
embryonic development, suggesting the potential important roles in male germ-cell development. They 
might also relate to DNA methylation, chromatin conformation, and translational regulation (J. Cell. Biol. 
115: 887-903, 1990; Cell 62: 503-514, 1990). 

35 It was revealed by immunofluoresence test that HDGF protein was located mainly in cytoplasm (J. 

Biol. Chem. 269(40): 25143-25149, 1994). The amino acid sequences of family members all contained a 
putative Nuclear Localization Signal (NLS), and none had any signal peptide sequence, which suggested 
they may play a role as a nucleoprotein. Fibroblast Growth Factor (FGF) was located in nuclear by this 
signal sequence to exert its mitogenic activity. Furthermore, the acidic amino acid sequence in the C- 
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Example 2 

Homologous comparison 
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terminus of HDGF shared a high homology to that of HMG-1/-2 of HMG family and said sequence was 
known to be a histone-binding region in HMG-1/-2. (Biochemistry 29: 4419-4423, 1990). Summing up, it 
was hkely that HDGF functions as a transcriptional factor to stimulate cell growth after internalization 
(Biochem. Biophys. Res. Comun. 238: 26-32, 1997). HDGF2 of the invention had similar activity. 
5 The mitogenic activity of HDGF implied the great application value of HDGF in treating pernicious 

oxyhepatitis and liver injury (Clin. Chim.Acta. 183L 273-284, 1989). Researches indicated that many 
fibroblast growth factors were capable of promoting the growth of epithelial cells and could be widely 
applied to the vascularization defects, i.e., ischemia and atherosclerosis, and to neuron development (Blood 
91(10): 3527-3561, 1998; Ann. N. Y. Acad. Sci. 545: 240-252, 1998). The application of HDGF 1 and 
1 0 HDGF2 of the invention in promoting the growth of fibroblast needs further study. 

The HDGF2 of the invention can be used not only as a member of the family in the study of function, 
y but also to produce fusion proteins with other proteins, such as immunoglobulins. Besides, HDGF2 can be 

Cj fused with or exchange fragments with other members of the family to form new proteins. For example, the 

ffl N terminal of HDGF2 can exchange with the N terminal of HDGFl or mice HDGF to produce proteins 

. ~] 1 5 which are more active or have new properties. 

flj The antibodies against HDGF2 can be used to screen other members of the family or to purify the 

™ related proteins such as other members of the family through affinity purification. 

W Example 3 

20 Expression of HDGF2 in E. coli 

O In this example, the cDNA sequence encoding HDGF2 was amplified with oligonucleotide PGR 

primers corresponding to 5'- and 3'-end of said DNA sequence. The resultant HDGF2 cDNA was used as an 
insertion fragment. 

The sequence of 5'-end oligonucleotide primer was: 
25 5'-CCACGGATCCATGGCGCGTCCGCGGCCCC-3' (SEQ ID NO: 5). 

This primer contained a cleavage site of restriction endonuclease BamH I, followed by 19 nucleotides 
of HDGF2 coding sequence starting from the start codon. 
The sequence of 3 '-end primer was: 

5'-ATCCGTCGACTTAGGTCCCTTCACTGGTT-3'(SEQ ID NO: 6). 
30 This primer contained a cleavage site of restriction endonuclease Sail, a translation terminator and 

partial HDGF2 coding sequence. 

These cleavage sites of restriction endonuclease in primers corresponded to the cleavage sites in 
bacterial expression vector pQE-9 (Qiagen Inc., Chatsworth, CA). Vector pQE-9 encodes an antibiotic 
resistance (Amp"), a bacterial replication origin (ori), an IPTG-adjustable promotor/operon (P/O), a 
35 ribosome-binding site (RBS), a six-hisitine tag (6-His) and cloning sites of restriction endonuclease. 

Vector pQE-9 and insertion fragments were digested by BamHI and Sail, and then linked together, 
ensuring that the open reading frame started from the bacterial RBS. Then, the linkage mixture was used to 
transform E.coli M15/rep4 (Qiagen) containing multi-copy of plasmid pREP4 which expressed repressor of 
lad and was resistant to kanamycin (Kan*). Transformants were screened out in LB medium containing 
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Amp and Kan. The plasmids were extracted. The size and direction of the inserted fragments were verified 
by PstI digestion. The sequencing confirmed that HDGF2 cDNA fragment was correctly inserted into the 
vector. 

The positive clones of transformant were cultured overnight in LB liquid medium supplemented with 
5 Amp (lOOug/ml) and Kan (25ug/ml). The overnight culture was 1:100-1:250 diluted, inoculated into large 
volume medium, and cultured until the 600nm optical density (OD600) reached 0.4-0.6. IPTG 
(isopropylthio-beta-D-galactoside) was added to final concentration of ImM. By deactivating repressor of 
Lad, IPTG induced and promoted P/O, thereby increasing the expression of gene. The cells were cultured 
for another 3-4 hours, and then centrifuged (6000 X g, 20 mins). The cultures were sonicated, and cell 

10 lysate was collected and diluted with 6M guanidine hydrochloride. After clarification, the dissolved 
HDGF2 in solution were purified by nickel-chelated column chromatography under the conditions suitable 
for the tight binding of 6-His tagged protein and column. HDGF2 was eluted with 6M guanidine 
hydrochloride (pH 5.0). The denaturalized proteins in guanidine hydrochloride were precipitated by several 
methods. First, guanidine hydrochloride was separated by dialysis. Alternatively, the purified protein, 

15 which was isolated from nickel-chelated column, bound to the second column with decreased linear 
gradient of guanidine hydrochloride. The proteins were denatured when binding to the column. Then, the 
proteins were eluted with guanidine hydrochloride (pH 5.0). Finally, the soluble proteins were dialyzed 
with PBS, then preserved in glycerol stock solution with the final glycerol concentration of 10% (w/v). 
The molecular weight of the expressed protein was about 23 kDa, as identified by 12% SDS-PAGE. 

20 Moreover, the sequencing results of the 10 amino acids at the N- and C-terminal of the expressed 

protein indicated that they were identical to those in SEQ ID NO: 4. 

Example 4 

Expression of HDGF2 in eukaryotic cells (CHO cell line) 
25 In this example, the cDNA sequence encoding HDGF2 was amplified with oligonucleotide PGR 

primers corresponding to 5'- and 3 '-end of said DNA sequence. The resultant product was used as an 
insertion fragment. 

The sequence of 5'-end oligonucleotide primer was: 
5'- CCCTAAGCTTATGGCGCGTCCGCGGCCCC-3'(SEQ ID NO: 7), 
30 This primer contained a cleavage site of restriction endonuclease Hindlll, followed by 19 nucleotides 

of HDGF2 coding sequence starting from the start codon. 
The sequence of 3'-end primer was: 

5'-TTTCGGATCCTTAGGTCCCTTCACTGGTT-3' (SEQ ID NO: 8) 

This primer contained a cleavage site of restriction endonuclease BamHI, a translation stop codon, and 
35 partial HDGF2 coding sequence. 

These cleavage sites of restriction endonuclease in primers corresponded to the cleavage sites in 
expression vector pcDNA3 for CHO cell. This vector encoded two kinds of antibiotic resistance (Amp'^and 
Neo*), a phage replication origin (fl ori), a virus replication origin (SV40 ori), a T7 promoter, a virus 
promoter (P-CMV), a Sp6 promoter, a polyadenylation signal of SV40 and the corresponding polyA 
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sequence thereof, a polyadenylation signal of BGH and the corresponding poly A sequence thereof. 

The vector pcDNA3 and insertion fragment were digested with Hindlll and BamHI, and linked 
together. Subsequently, E.coli strand DH5 a was transformed with linkage mixture. Transformants were 
screened out in LB medium containing Amp. The clones containing the needed constructs were cultured 
5 overnight in LB liquid medium supplemented with Amp (100 ug/ml). Plasmids were extracted. The size 
and direction of the inserted fragments were verified by PstI digestion. The sequencing indicated that 
HDGF2 cDNA fragment was correctly inserted into the vector. 

Plasmids were transfected into CHO cells by lipofection with Lipofectin Kit (GIBco Life). After 
transfecting the cells for 48 hours and screening the cells with G418 for 2-3 weeks, the cells and cell 

10 supernatant were collected and the activity of the expressed protein was measured. G418 was removed and 
the transformants were subcultured continuously. The mixed clonal cells were limiting diluted and the 
subclones with higher protein activity were selected. The positive subclones were mass cultured by routine 
methods. 48 hours later, the cells and supernatant were collected. The cells were ultrasonicated. Using 
50mM Tris-HCl (pH7.6) solution containing 0.05% Triton as an equilibrium solution and eluent, the active 

1 5 peek of the protein was collected with a pre-balanced Superdex G-75 column. Then, using 50mM Tris-HCl 
(pH8.0) solution containing 0-1 M NaCl as an eluent, the protein was gradiently washed on a DEAE- 
Sepharose column balanced with 50mM Tris-HCl (pH8.0) solution. The active peek of the protein was 
collected. The solution of the expressed protein was dialyzed with PBS (pH7.4), and finally lyophilized and 
preserved. 

20 The molecular weight of the expressed protein was about 23 kDa as identified by 12% SDS-PAGE. 

Moreover, the sequencing results of the 10 amino acids at the N- and C-terminal of the expressed 
protein indicated that they were identical to those in SEQ ED NO: 4. 

Example 5 

2 5 Antibody preparation 

Antibodies were produced by immunizing animals with the recombinant proteins obtained in Examples 
3 and 4. The method was as follows: the recombinant proteins were isolated by chromatography, and stored 
for use. Alternatively, the protein was isolated by SDS-PAGE electrophoresis, and obtained by cutting 
eletrophoretic bands from gel. The protein was emulsified with Freund's complete adjuvant of the same 

30 volume. The emulsified protein was injected intraperitoneally into mice at a dosage of 50-100ug/0.2ml. 14 
days later, the same antigen was emulsified with Freund's incomplete adjuvant and injected 
intraperitoneally into mice at a dosage of 50-100ug/0.2ml for booster immunization. Booster immunization 
was carried out every 14 days, for at least three times. The specific activity of the obtained antiserum was 
evaluated by its ability of precipitating the translation product of HDGF2 gene in vitro. 

35 

All the documents cited herein are incorporated into the invention as reference, as if each of them is 
individually incorporated. Further, it is appreciated that, in the above teaching of the invention, the skilled 
in the art can make certain changes or modifications to the invention, and these equivalents are still within 
the scope of the invention defined by the appended claims of the present application. 
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